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ABSTRACT 

A descriptive study was conducted to delineate a 
structured representation of one abstract type (theoretical) and 
suggest how this predictable representation could affect user-system 
interfaces of information '"'^trieval systems. Fifty-five argumentative 
type abstracts were selected from issues of "International Political 
Science Abstracts." Each abstract was classified into one of three 
groupings: Substantive, Authoritative, or Motivational. Each abstract 
was exami'*»ed for the presence of the first and the second triads of 
elements — i.e. (1) data, warrant, and claim; and (2) backing, 
rebuttal, and qualifier. Of the abstracts, 68% belonged to the 
Substantive class, 20% to the Authoritative class, and 11% to the 
Motivational class. The first triad of elements was found in all of 
the abstracts. Of the second triad of elements. Rebuttal was found in 
16, Qualifier in 24, and Backing in 28 of the abstracts. It is 
concluded that patterns of form and structure can be found in the 
argumentation texts, and therefore, that the rhetorical structure of 
arc 'ment is a useful framework in which to describe the structure of 
theoretical abstracts. Further, if a user-system interface could make 
use of these findings in information retrieval systems, users would 
be aidea in comprehending the content of retrieved abstracts and in 
refining their .search queries. (8 references) (SD) 
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INTROpyCTION 

Information retrieval is concerned with, finding those few docu- 
ments in a system which best fit the users' current information 
need* As such^ the retrieval process is concerned with two types 
of representations: statement of the users' information needs^ 
problems or queries^ and the documents or their surrogates stored 
in the retrieval system. Performance of the system could be 
upgraded by improving either of these representations. 

There is much in the literature on users' problem statements 
(Taylor, 1968; MacMullin & Taylor, 1984; Belkin, Oddy & Brooks, 
1982), to suggest that expecting more of users' statements is not 
realistic. As Taylor pointed cut in his seminal paper (1968), 
having users describe what it is they need, when t:hey themselves 
are hardly articulate about ^hat it is they don't yet know, is 
putting quite a strain on the communication aspect of the 
retrieval process. 

At the same time, document representation techniques have 
advanced very little since the last major innovation of making 
the natural language portion of docviments, usually abstracts, 
available for free text searching. This is not to say that more 
could not be done to improve the docximent representations, but 
until fairly recently, the reliance within information retrieval 
on statistical techniques of . content representation showed few 
opportunities for substantive improvement. In current retrieval 
systems, a variety of term frequency measures are used in an 
attempt to determine which documents might be more 'about' a 
user's topic of interest. Use of linguistic techniques, intui- 
tively the most appropriate approach, has been limited to lower 
levels of linguistic analysis, namely morphology (stemming) and 
syntax (noun phrase identification). Recently, the feasibility 
of applying higher levels of linguistic processing, such as dis- 
course analysis, for the improvement of document representation 
has been recognized. It may be possible for this new document 
representation to be of assistance to the user in increasing the 
understanding and statement of the information need. 

In this paper, we report on efforts to delineate a structured 
representation of one abstract type and suggest how this improved 
representation may affect the user-system interface of retrieval 
systems. The impetus for this investigation was the highly 
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encouraging results found by Liddy {1988) in describing the 
discourse -level structure of empirical abstracts. In the hope of 
extending this general approach to a quite distinct type of 
abstract^ we sought to discover whether abstracts reporting on 
theoretical work have a predictable set of information components 
and whether these occur in any predictable ordering within 
abstracts . 

DISCOURSE 21NALYSIS 

Since documents ' and abstracts are texts ^ they are amenable to 
discourse analysis^ the level of linguistics which is concerned 
with how units of language larger than a sentence function. Dis- 
course linguistics is concerned with how these texts communicate ^ 
both through their structural organization and through the mean- 
ing imparted by sentences interpreted as a text^ rather than 
singly. One major aspect of discourse linguistics is the investi- 
gation of the nature of the implicit structural organization of 
information in texts of different types. This is referred to as 
either text-level or discourse-level structure. It can be thought 
of as a superstructural organization of semantic content. How- 
ever^ it is not devoid of meaning^ for the structure itself 
implicitly communicates semantically. For example, the simple 
fact that a sentence is the last line or a story communicates 
that what it contains is to be interpreted as the end of the sto- 
ry. 

The theory of discourse lincniistics suggests that texts which 
serve a common purpose among a community of users eventually take 
on a rather predictable structure and organization. Readers of 
office memos, city ordinances, academic course descriptions, and 
even obituaries could all attest to this fact. 

STRUCTURE AND FORM 

The overriding concern here is a search for the structure and 
form of the rhetorical reasoning process that can assist in the 
understanding of the nature of argumentation as it appears in 
theoretical abstracts found in information retrieval systems. 
Conventional argumentation theorists have been influenced by, and 
have rooted their work in classical Aristotelean rhetorical 
proof. (Anderson & Dovre, 1968, p. 235). The essence of this 
form differentiates between the treatment of probabilities 
(enthemene) and that of certainties (syllogism). 

A modern incarnation of this classical . form is found in the work 
of McBurney and Mills (1968) who consider argumentation to be 
primarily concerned with levels of certainty. This orientation 
led the authors to develop a structure for analysis based on the 
inductive-deductive frame of reference. According to this model, 
argument is seen as a ranr,e of probabilities i.e. •possible, 
probable, plausible or certainly true'. (McBurney & Mills, 1968 
p. 240) The authors also make the case for the inductive process 
as a means of getting at deductive argument. Here, induction is 
used as a linkage in the reasoning sequence to get at deductive 
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argtiraents. Utilizing this philosophical base as a point of 
departure^ McBurney and Mills developed a classification of argu- 
ments based on the logical relationship between the premise and 
conclusion* The forms of argviment they identify aret cause^ sign^ 
example and analogy* 

Yet another approach to argument understanding has been posited 
by Brockriede and Ehninger (1968)* Employing the conceptual 
principles developed by Toulmin (1964)^ these authors have 
devised a schema for classifying forms of argument* In addition 
to emphasizing the logical relationships between parts of an 
argument^ Zhe schema adds a new dynamic to argument understanding 
theory by incorporating logical (substantive) and non- 
logical (authoritative^ motivational) dimensions to the classifi- 
catory design. The Brockriede and Ehninger interpretation of the 
Toulmin model is useful for two main reasons in that it (a) pro- 
vides a structxxre for analysis and criticism of rhetorical argu- 
ments and (b) suggests a typology for classifying arguments 
based on rhetorical proofs. 

Toulmin' s Construct 

It is useful to elaborate the structure for argument analysis 
developed by Toulmin (1964). This model provides a two-tiered 
analyses of argument described as * movement from accepted data 
through a warrant to a claim* * The first tier consists of data, 
warrant and claim. 

Data - consist of facts that reflect events, statistical data, 
citations from authority etc. If data do not exist, an essential 
component of argument is missing, because there is no factual 
base and the argument fails to inform. 

Claim - is the substance of the idea conveyed in an argument. 
It has been referred to as the main proof line by Brockriede & 
Enginger (1968). The claim can ocour either as the final state- 
ment in an argument or it could be some midway position in an 
argument. The usual order is data first then claim. In this 
order, claim connotes 'therefore*. The reverse order infers 'be- 
cause * . 

Warrant - is the description of that component of the argument 
that moves the argument from data to c?.aim. It certifies the 
proposition in the' claim statement so that it becomes acceptable. 

In addition to this first triadi data, warrant and claim, Toul- 
min' s construct contains a second set of components of which all 
or some may exist in an argument. These are termedt backing, 
rebuttal and qualifier. 

A backing is made up of measures or credentials designed to cer- 
tify the beliefs of the warrant. This can be a single i*-em or an 
entire argument within itself, complete with the first level 
triad of data, claim and warrant. 
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The rebuttal acts as a safety mechanism and is often attached to 
the claim statement* The rebuttal recognizes the cases in which 
the claim will be constrained^ is invalid or is in some way in 
need of a qualifier. The rebuttal places limitations on the scope 
of the validity of a claim by foreseeing possible objections* 

The role of the qualifier is to determine the strength of an 
argument* It does this by quantifying terms (e.g. ^probably*) or^ 
by pointing out possible fallacies. When an argument is thought 
to be axiomatic and cannot be disputed^ no qualifier is attached. 

Classification Approaches 

Brockriede and Ehninger devised their classification system based 
on the attributes of rhetorical proof. The first type of argu- 
ment^ Substantive^ is pictured as data which progress through to 
claim based on beliefs about things in the external world. The 
second/ Author i tat ive^ is grounded in structures about the quali- 
ty of the source from which the data are derived. The thirds 
Motivational/ is based upon assumptions related to the emotive 
state/ e.g. the ambitions, inner drives etc./ of those who hear 
the argument. The assumptions of each class and its relationship 
to the two-level triad of elements for argument analysis are 
examined in turn. 

1. Substantive Argviments - The substantive argument permits 
the discourse to move from from data to claim and is rooted 
in assumptions about the logical relationships that exist 
in the external world. There are six possible orderings of 
of this typ^a of argument: 

a. Cause - In this type of argument/ data comprise 
facts about an event/ person / object or situation. 
The warrant provides the source of power for these 
facts and tells what effects they will have while 
claim relates these effects to the d{ :a. 

b. Sign - The data are made up of a set of symptoms. 
The warrant assigns appropriate meaning to these 
symptoms and claim explains the objects/ people or 
situations that have these symptoms. 

c. Generalization - The data are made up of information 
about objects/ persons, events or conditions repre- 
sentative of given items in a class of things. The 
warrant posits that this representative sample will 
extend to the whole population^ The claim clarifies 
the underlying assumptions of the warrant. 

d. Parallel Case - The data are made up of one or more 
statements about a single object/ event or condition. 
The warrant states that the case reported in the data 
resembles a second instance of the same category. The 
claim assumes that the new case can be likened to the 
first. A rebuttal in a parallel case argument 
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obtains when (a) another parallel case has a strong 
similarity to the existing case or (b) some strong 
dissimilarity disaffirms the warrant* 

e. Analogy - The data report that a relationship of a 
certain nature exists between a pair of items* The 
claim amplifies the relationship implied in the war- 
rant* The distinction between argument from parallel 
case and argument from analogy is that the former 
assumes resemblance between two cases while the lat- 
ter assumes a similarity of relationship. Frequently^ 
the relationship expressed in this type of argument 
requires the qualifier 'possibly'. 

f. Classification - In this type of argument^ the data 
reflect a generalized conclusion about known members 
of a class. The warrant assumes that the properties 
of the items under consideration can be extended to 
those items of the class which have yet to be exam- 
ined. The claim gives to specific items the charac- 
teristics of the general statement • The provisos to 
argument from classification are (a) a class member 
may not share the special characteristics of the 
class specified in the data but may have enough other 
characteristics to justify membership in that class 
and (b) there may be instances when a givan class 
member may not share the attributes of a class. 

2. Authoritative Arguments - In th:ls type of argument^ data 
are composed of either factual reports or stated opinions. 
The warrant attests to the reliability of these reports or 
opinions. The claim reinforces the data statements whose 
credentials are supplied by the warrant. 

3. Motivational Arguments - The data in this type of argument 
is composed of statements grounded in the claims of previ- 
ous arguments. The warrant provides the rationale for 
accepting the claim by appealing to the some emotive quali- 
ty # ^•^h desire^ in the hearer. 

METHODOLOGY 

In order to validate this classification scheme and the specific 
ordering of the different components of information in the vari- 
oxis types of arguments defined above ^ a descriptive study was 
conducted on a sample of abstracts. . Fifty-five argumentative 
type abstracts were selected from different issues of Interna - 
tional Political Science Abstracts under the headings 'Political 
Thinkers and Ideas' and 'Methods and Theories'. 

Each abstract was classified by the first author (Francis) nto 
one of three major groupings! substantive^ authoritative or moti- 
vational^ based on the Brockriede and Ehninger (1968) classifica- 
tion. The choice was made by locating different kinds of propo- 
sitions i.e. those of value ^ fact or of policy. The abstracts 
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reflecting each type of argument were subdivided into one of the 
six subcategories. Each abstract was examined for presence of 
both the first triad (data^ warrant and claim) and second triad 
(backing^ rebuttal and qualifier) of elements. 

Linguistic Clues 

The second stage of the investigation was an examination of lexi- 
cal clues and verb tenses. In this context ^ clue words are those 
words and phrases which ar^e used by writers to indicate explicit- 
ly how parts of the discourse should be interpreted. For our 
purposes^ it was decided to concentrate on whether some of the 
clues to cohesion specified by Halliday and Hasan (1976) could be 
relied on to reveal which function of an argument was being per- 
formed by the different passages in each abstract. 

Halliday and Hasan have outlined a taxonomy of types of cohesive 
relationships which conjoin one portion of text to another. One 
familiar *:ype of explicitly marked cohesive relationship in texts 
is indicated by markers which relate what follows to what hr.s 
been said before^ e.g. *and*^ *but*^ *soS 'then*. Halliday and 
Hasan (1976) provide an extended discussion of the relationships 
indicated by such markers. For example^ the classes of explicit 
markers for conjunctive relations are: 

• Additive - and^ or^ furthermore^ similarly^ in addition 

• Adversative - but, however^ on the other hand^ nevertheless 

• Causal - so^ consequently^ for this reason^ it follows from 
this 

• Temporal - then^ after that^ an hour later ^ finally^ at last 

A frequency analysis was performed on all words occurring in the 
55 abstracts in an attempt to determine whether there was a 
skewed usage of conjunctive clues within different components of 
the argumentative abstracts. 

RESULTS 

Of the fifty-five abstracts^ 38 (08%) were found to belong to the 
Substantive class, 11 (20%) to the Authoritative, while 6 (11%) 
were classed as Motivational. The Substantive class was sub- 
divided into the six categories detailed above. Substantive/Sign 
arguments occurred most frequently (20/38 times) followed by 
Substantive/Cause 11/38 times. There were no abstracts classed as 
Subs t ant i ve/Analogy . 

Table 1 shows the number of occurrences of the two-level triad of 
elements with the three main classification groups as voll as the 
sub-groups of the Substantive class. The first triad of elements 
(Data, Warrant, Claim) was found in all fifty-five abstracts. In 
the case of second triad. Rebuttal was found in 16 of the 
abstracts, while Qualifier was found in approximately half of the 
abstracts (24). Backing occurred 28 times. 
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Table 1 : Frequency of 2-Level Triad of Elements 



1st Triad 
Data Warrant Claim 



2nd Triad 
Back. Rebut t. Qualif. 



Substantive 
(N = 38) 



Cause 


11 


11 


11 


7 




5 


Sign 


20 


20 


20 


9 


11 


11 


Gene r al i zat ion 


3 


3 


3 


2 




2 


Parallel Case 


3 


3 


3 


3 


1 


2 


Analogy 














Classification 


1 


1 


1 









Authoritative 
(N = 11) 

Motivational 
(N = 6) 



11 
6 
55 



11 
6 
55 



11 
6 
55 



6 
1 

26 



3 
1 

16 



7 
3 
24 



The results showed a reasonably predictable ordering of the 
structural components. Data preceded warrant (40/50) times; war- 
rant preceded backing (33/55) times. An analysis was made of the 
observed orderings of the six components to detect what the most 
frequent order might be. As can be seen in Figure 1, there is no 
invariant ordering, although the most likely structure would be [ 
DATA - WARRANT - BACKING - CLAIM ]. There were no distinctive 
orderings for the three major classes of arguments established by 
Brockriede and Ehninger (1968). That is, substantive arguments, 
authoritative arguments, and motivational arguments do not differ 
significantly one from each other in terms of the ordering of 
argument components. The conjunctive relations suggested by Hal- 
liday and Hasan's work on cohesion were not found to be pre- 
dictive of the particular function in an argument being pleved by 
the pieces of text. 
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Data 



40 



, — ^ Warrant 



8 



33 



Backing 



21 



Claim 



14 




Rebuttal 



■p- Qualifier 



Figure 1: Observed Orderings of Components in 
55 Theoretical Abstracts 
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CONCLUSIONS 

The above presentation of the different types of argument defi- 
nitely suggests that patterns of form and structure can be found 
in argumentation texts. These results strongly support the prem- 
ise which inspired this invest igat ion ^ namely that the rhetorical 
structure of argument is a useful framework in which to describe 
the structure of theoretical abstracts. In addition^ since there 
were no significant differ^^ices between abstracts reflecting dif- 
ferent types of arguments in terms of the orderings of compo- 
nents^ it would appear that a single model will be suitable for 
describing th^ typical orderings of all such abstracts. However # 
the final goal of the re3earchers on the project^ of which this 
study was a part^ was to determine whether these orderings can be 
detected automatically by use of lexical clues ^ In this sense ^ 
the investigation w^s only partially successf il. In order to 
present document representations which usefully identify the role 
that each portion of rext performs in the argument ^ an automatic 
means of detecting these roles need to be identified. Although 
this research did not successfully establish the set of lexical 
clues which could be relied on to automatically structure theo- 
reticfil abstracts^ it did rielineate the culturally validated dis- 
course structure of argumr itative discourse as being the struc- 
ture underlying theoreticax abstracts. 

The importance of this discourse-level structure to information 
retrieval systems is thit it offers the potential of a predicta- 
ble structured repres;:^ntation in which the specific content of 
individual theoretical abstracts might be more easily evaluated 
by users. If an interface could take advantage of a structure 
like that in Figure 2, those users who are not very familiar with 
the topic on which they are seeking information would be aided in 
improving their comprehension of the retrieved documents. Perhaps 
even more importantly^ these structured representations of 
abstracts might be useful to users as a means of refining their 
queries in the next iteration of the search. 
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AUTHOEtTTATIVE 

DATA According to TocquGville^ the most important determi- 

nant of the character of any society is the political 
culture {Moeurs) • 

CLAIM A political culture is shaped not only by sociological 
conditions and Laws^ but also^ in modern times ^ ' by 
ideas propounded by intellectuals. 

WARRANT In Tocqueville's day^ two dominant schools of thought 
were contending for influence over the public miiid of 
Europe t philosopher rationalism and traditional 

BACKIfJG Neither one of these schools^ Tocqueville argued^ pro- 
moted a political culture that could reconcile liberty 
and democracy. 

QUALIFIER Unlike the opposing schools^ the new political science 
could not be propagated directly as an ideology. 

BACKING Its implementation relied on an indexed strategy - 
using institutions to inculcate certain 'mental habits' 
among citizens. This in turn called for ways of limit- 
ing the role of intellectuals in influencing political 
culture. 

Figure 1 : Sample Abstract 
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